Belief revision

Results: 146



#Item
21REINFORCEMENT LEARNING FOR LIVE MUSICAL AGENTS Nick Collins University of Sussex  ABSTRACT Current research programmes in computer music may

REINFORCEMENT LEARNING FOR LIVE MUSICAL AGENTS Nick Collins University of Sussex ABSTRACT Current research programmes in computer music may

Add to Reading List

Source URL: composerprogrammer.com

Language: English - Date: 2014-05-01 12:44:10
22de Budgeted Classification-based Policy Iteration presented by Victor Gabillon

de Budgeted Classification-based Policy Iteration presented by Victor Gabillon

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2015-07-14 00:09:21
23Journal of Experimental Psychology: Learning, Memory, and Cognition 2005, Vol. 31, No. 2, 374 –377 Copyright 2005 by the American Psychological Association/$12.00 DOI:

Journal of Experimental Psychology: Learning, Memory, and Cognition 2005, Vol. 31, No. 2, 374 –377 Copyright 2005 by the American Psychological Association/$12.00 DOI:

Add to Reading List

Source URL: www.stefanfrank.info

Language: English - Date: 2012-06-04 17:12:23
24A Few Arguments against Counterfactual Accounts of Causation Haitao Cai Department of Linguistics, University of Pennsylvania The notion of causation is intimately related to that of counterfactuals. For example, an even

A Few Arguments against Counterfactual Accounts of Causation Haitao Cai Department of Linguistics, University of Pennsylvania The notion of causation is intimately related to that of counterfactuals. For example, an even

Add to Reading List

Source URL: www.macsim.us

Language: English - Date: 2013-04-02 10:18:48
25SWIRL: A Sequential Windowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards Sanjay Krishnan, Animesh Garg, Richard Liaw, Brijen Thananjeyan, Lauren Miller, Florian T. Pokorny∗ , Ken Goldb

SWIRL: A Sequential Windowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards Sanjay Krishnan, Animesh Garg, Richard Liaw, Brijen Thananjeyan, Lauren Miller, Florian T. Pokorny∗ , Ken Goldb

Add to Reading List

Source URL: goldberg.berkeley.edu

Language: English - Date: 2016-07-21 11:29:30
26Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees Georgios Theocharous Adobe Research

Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees Georgios Theocharous Adobe Research

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2015-05-02 18:41:02
27Sutton, Richard  PIN

Sutton, Richard PIN

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2013-10-18 16:05:54
28Boosted Bellman Residual Minimization Handling Expert Demonstrations Bilal Piot1,2 , Matthieu Geist1,2 , Olivier Pietquin3 1  3

Boosted Bellman Residual Minimization Handling Expert Demonstrations Bilal Piot1,2 , Matthieu Geist1,2 , Olivier Pietquin3 1 3

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-07-15 03:12:51
29Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Add to Reading List

Source URL: www.emnlp2015.org

Language: English - Date: 2015-09-04 01:25:56
30General Reinforcement Learning Jan Leike Future of Humanity Institute University of Oxford  9 June 2016

General Reinforcement Learning Jan Leike Future of Humanity Institute University of Oxford 9 June 2016

Add to Reading List

Source URL: intelligence.org

Language: English - Date: 2016-06-10 12:39:29